Virtual incremental storage apparatus method and system

ABSTRACT

An apparatus for managing incremental storage includes a storage pool management module that allocates storage volumes to a virtual volume. Also included is an incremental log corresponding to the virtual volume, which maps virtual addresses to storage addresses. The apparatus may also include a replication module that sends replicated data to the virtual volume and a policy management module that determines allocation criteria for the storage pool management module. In one embodiment, the incremental log includes a lookup table that translates read and write requests to physical addresses on storage volumes within the virtual volume. The replicated data may include incremental snapshot data corresponding to one or more primary volumes. The various embodiments of the virtual incremental storage apparatus, method, and system facilitate dynamic adjustment of the storage capacity of the virtual volume to accommodate changing amounts of storage utilization.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No.10/713,445 filed on Nov. 14, 2003, now U.S. Pat. No. 7,412,583 entitledVIRTUAL INCREMENTAL STORAGE APPARATUS METHOD AND SYSTEM, and claims allrights of priority and incorporation available due to this continuationrelationship.

BACKGROUND OF THE INVENTION

1. Field

The invention relates to devices, methods, and systems for storing andreplicating data. Specifically, the invention relates to devices,methods, and systems for dynamically sizing a target volume.

2. The Relevant Art

Data processing systems are often required to copy large amounts of datato secondary storage or another primary data store. Historically,copying entire volumes of data required that a system suspend all tasksthat may access a source or a target storage volume. Suspension of thevarious tasks or processes to conduct the copy operation greatly reducedthe performance of such systems.

Incremental backup techniques have been developed to lessen theprocessing burden and any stoppage time associated with data archivingoperations. Incremental backup methods are typically conducted bysoftware that monitors or detects storage volume changes at a filelevel. For example, an incremental backup process running on the hostsystem may be required to scan all of the files within the file systemin order to ascertain which blocks of data on the storage subsystem havechanged since the last backup. To perform an incremental backup, anyfiles that have been modified since the last backup are copied to astorage volume during an incremental backup session.

One disadvantage of scanning file changes on the host is the processingburden associated with detecting changes. However, in one prior art filesystem, incremental changes are inherently tracked by the log-basedinternal structure of the file system. FIG. 1 is a schematic diagramillustrating a log-based storage system 100 illustrating the need forthe present invention. The depicted log-based storage system 100includes buses 102, 104, 106, and 108, a host 110, a primary volume 120,a replication module 130, an incremental log 140, a storage volume 150,a read module 160, and a baseline volume 170. The storage system 100uses log-based incremental backup techniques to replicate data stored onthe primary volume 120.

The host 110 facilitates communication among the various components ofthe storage system 100. The host 110 communicates with the primaryvolume 120, the replication module 130, and the baseline volume 170through buses 102, 104, and 106 respectively. The host 110 requests datafrom and sends data to the primary volume 120 via the file system 112,which translates operating system calls to operations on the primaryvolume 120.

During backup operations, the host 110 may also send one or morebaseline images to the baseline volume 170. For example, the host maysend a baseline image of the primary volume 120, corresponding to areplicated image of the primary volume 120 at a particular point intime. Subsequently, the replication module 130 may copy incrementalchanges made on the primary volume 120 to the storage volume 150.Together, the incremental changes and the baseline image are used duringrecovery operations to restore the primary volume 120 to a previousstate. In addition, the incremental changes may be accessed at any timethrough the read module 160.

In the depicted embodiment, replication operations conducted by thereplication module 130 are facilitated by the incremental log 140, whichtranslates read and write requests to physical addresses on the storagevolume 150. The incremental log 140 generally includes a lookup table orsimilar data structure that facilitates translating read and writerequests to physical addresses. In general, the lookup table or similardata structure is a mapping of a virtual address to a block address onthe storage volume 150.

One problem with prior art storage systems, such as the prior artstorage system 100, is selecting a proper storage capacity for thestorage volume 150. In particular, the storage capacity required forincremental changes is highly application dependent. Selecting acapacity that is too large for the incremental changes results in wastedstorage space. Selecting a capacity that is too small for theincremental changes could result in replication failure, loss of dataredundancy, and vulnerability to failures. In order to maintain dataredundancy under a wide variety of conditions, administrators typicallyerr on the side of reserving too much capacity, resulting in wastedstorage space that could be used for other applications.

Accordingly, what is needed is a storage volume with a dynamic capacitythat prevents replication operation failure and wasted storage space. Inparticular, what is needed are apparatus, methods, and systems fordynamically adjusting the storage capacity of a storage volume inresponse to increasing or decreasing storage space requirements.

BRIEF SUMMARY OF THE INVENTION

The present invention has been developed in response to the presentstate of the art, and in particular, in response to the problems andneeds in the art that have not yet been fully solved by currentlyavailable storage systems. Accordingly, the present invention has beendeveloped to provide a method, apparatus, and system for dynamicallyadjusting the storage capacity of a storage volume.

In a first aspect of the present invention, an apparatus for managingincremental storage includes a storage pool management module thatallocates storage volumes to a virtual volume and an incremental logcorresponding to the virtual volume that maps virtual addresses tostorage addresses. Due to the unique configuration of the storage poolmanagement module, the storage capacity of the virtual volume can bedynamically adjusted to accommodate high or low levels of storage use.

The storage pool management module may monitor available space on thestorage volumes. In one embodiment, the storage pool management moduleallocates an additional storage volume to a virtual volume in responseto observing a reduction in available space on an initial storagevolume. The storage pool management module can subsequently adjust thenumber of volumes assigned to the virtual volume, in response tochanging storage requirements for the virtual volume.

The storage volumes of the present invention may include at least onevolume corresponding to a redundant set of storage devices.Alternatively, a storage device may contain multiple storage volumes. Inone embodiment, the redundant set of storage devices may be a RAID arrayof storage volumes, or the like. Using a RAID array of storage volumesallows for high performance reading and writing, data redundancy, andmay reduce the complexity of implementing the virtual volume.

In certain embodiments that use a RAID array, the incremental log of thepresent invention may include a stripe map that facilitates mapping avirtual address to a storage address on the RAID array. In oneparticular embodiment, the storage address includes a volume identifiersuch as a LUN (logical unit number) and a block identifier such as anLBA (logical block identifier). The volume and block identifiers pointto a physical location corresponding to data written on the storagevolumes.

The incremental log of the present invention may include a lookup table.The incremental log appends a storage address mapping to the lookuptable in response to a write operation. In addition, a read module canaccess data written to the virtual volume by sending a read request tothe lookup table, which translates the request to a physical location onthe storage volumes.

In the aforementioned embodiments, the incremental log of the presentinvention provides a simple, effective data structure for reading andwriting to multiple storage volumes within a single virtual volume. Inaddition, the incremental log may be configured to translate read andwrite requests to multiple virtual volumes, where each virtual volumeincludes a variable number of storage volumes.

In another aspect of the present invention, a method for managingincremental storage includes allocating storage volumes to a virtualvolume and appending a storage address mapping to an incremental logcorresponding to the virtual volume. The method of the present inventionconserves storage space by allocating storage volumes to the virtualvolume on an as-needed basis.

The method may further include monitoring storage availability on thestorage volumes. In response to observing reduced space on an initialstorage volume, the method assigns an additional storage volume to thevirtual volume. The method continues to assign storage volumes to thevirtual volume in response to detecting low levels of space on thevirtual volume.

In addition, in certain embodiments the method may provide incrementalsnapshot data of a source volume in response to a replication operation.This incremental snapshot data may be used in conjunction with abaseline image of the source volume to restore the source volume to aprevious state. Using incremental snapshot data in conjunction with abaseline volume reduces storage space requirements and also reducesbandwidth requirements for replication operations.

Various elements of the present invention are combined into a system formanaging incremental storage including at least one baseline volume, apool of storage volumes, a storage pool management module configured toallocate storage volumes from the pool of storage volumes to one or morevirtual volumes, and an incremental log corresponding to the virtualvolumes that maps a virtual address to a storage address.

Certain embodiments of the system also include a replication module thattransmits replicated data from a source volume to the virtual volume.The replicated data may include incremental snapshot data of the sourcevolume. Furthermore, in selected embodiments the system includes apolicy management module for determining allocation criteria used by thestorage pool management module.

In one configuration, the policy management module provides feedback toa user in response to monitoring reduced space on the virtual volume. Inanother configuration, the policy management module may allow a user tocustomize allocation criteria that define appropriate conditions forallocating and de-allocating storage volumes. Thus, the policymanagement module enables a user such as a system administrator tomanage the storage volumes within the virtual volume.

The present invention dynamically adjusts storage capacity on thevirtual volume, preventing critical applications from having to be takenoff-line to adjust storage capacity. These and other features andadvantages of the present invention will become more fully apparent fromthe following description and appended claims, or may be learned by thepractice of the invention as set forth hereinafter.

BRIEF DESCRIPTION OF THE DRAWINGS

In order that the advantages of the invention will be readilyunderstood, a more particular description of the invention brieflydescribed above will be rendered by reference to specific embodimentsthat are illustrated in the appended drawings. Understanding that thesedrawings depict only typical embodiments of the invention and are nottherefore to be considered to be limiting of its scope, the inventionwill be described and explained with additional specificity and detailthrough the use of the accompanying drawings, in which:

FIG. 1 is a schematic diagram illustrating a typical currently availablestorage system;

FIG. 2 is a schematic diagram illustrating a virtual incremental storagesystem of the present invention;

FIG. 3 is a schematic diagram illustrating one embodiment of a virtualincremental storage apparatus of the present invention;

FIG. 4 is a block diagram illustrating one embodiment of an incrementallog and associated storage volumes of the present invention; and

FIG. 5 is a flow chart diagram illustrating one embodiment of a virtualincremental storage method of the present invention.

FIG. 6 is a text-based diagram depicting one embodiment of a snapshotpool programming interface of the present invention;

FIG. 7 is a text-based diagram depicting one embodiment of a snapshotcriteria data structure of the present invention; and

FIG. 8 is a text-based diagram depicting one embodiment of a snapshotcriteria programming interface of the present invention.

DETAILED DESCRIPTION OF THE INVENTION

It will be readily understood that the components of the presentinvention, as generally described and illustrated in the Figures herein,may be arranged and designed in a wide variety of differentconfigurations. Thus, the following more detailed description of theembodiments of the apparatus, system, and method of the presentinvention, as represented in FIGS. 2 through 5, is not intended to limitthe scope of the invention, as claimed, but is merely representative ofselected embodiments of the invention.

Many of the functional units described in this specification have beenlabeled as modules, in order to more particularly emphasize theirimplementation independence. For example, a module may be implemented asa hardware circuit comprising custom VLSI circuits or gate arrays,off-the-shelf semiconductors such as logic chips, transistors, or otherdiscrete components. A module may also be implemented in programmablehardware devices such as field programmable gate arrays, programmablearray logic, programmable logic devices or the like.

Modules may also be implemented in software for execution by varioustypes of processors. An identified module of executable code may, forinstance, comprise one or more physical or logical blocks of computerinstructions which may, for instance, be organized as an object,procedure, or function. Nevertheless, the executables of an identifiedmodule need not be physically located together, but may comprisedisparate instructions stored in different locations which, when joinedlogically together, comprise the module and achieve the stated purposefor the module.

Indeed, a module of executable code could be a single instruction, ormany instructions, and may even be distributed over several differentcode segments, among different programs, and across several memorydevices. Similarly, operational data may be identified and illustratedherein within modules, and may be embodied in any suitable form andorganized within any suitable type of data structure. The operationaldata may be collected as a single data set, or may be distributed overdifferent locations including over different storage devices, and mayexist, at least partially, merely as electronic signals on a system ornetwork, or alternatively on one or more computer readable media.

Reference throughout this specification to “one embodiment” or “anembodiment” means that a particular feature, structure, orcharacteristic described in connection with the embodiment is includedin at least one embodiment of the present invention. Thus, appearancesof the phrases “in one embodiment” or “in an embodiment” in variousplaces throughout this specification are not necessarily all referringto the same embodiment.

The illustrated embodiments of the invention will be best understood byreference to the drawings, wherein like parts are designated by likenumerals throughout. The following description is intended only by wayof example, and simply illustrates certain selected embodiments ofdevices, systems, and processes that are consistent with the inventionas claimed herein.

FIG. 2 is a schematic diagram illustrating a virtual incremental storagesystem 200 of the present invention. The depicted embodiment of thevirtual incremental storage system 200 includes a bus 202, a host 210, apolicy management module 212, one or more primary volumes 220, areplication module 230, a storage pool management module 240, anincremental log 250, a storage pool 260 of storage volumes 262, and aread module 270. In addition, the virtual incremental storage system 200may include elements of the log-based storage system 100, includingbuses 102, 104, and 106, a file system 112, and a baseline volume 170.

One application of the virtual incremental storage system 200 isreplicating data stored on the primary volumes 220 for backup storagepurposes. As in the storage system 100, the baseline volume 170 receivesbackup data from the primary volumes 220. In addition, the storage pool260 of the present invention may receive backup data from the primaryvolumes 220 via the improved replication module 230 and incremental log250.

A significant advantage of the present invention over currentlyavailable storage systems is that the storage pool 260 may grow orshrink dynamically, according to the storage needs of the system inwhich it is deployed. While depicted in a secondary storage role, thestorage pool 260 and associated modules may also be used for primarystorage.

The host 210 is in communication with the primary volumes 220, thereplication module 230, and the baseline volume 170 through buses 102and 106, respectively. In one embodiment, the host 210 requests datafrom and sends data to the primary volumes 220 via the file system 112,which translates operating system calls to operations on the primaryvolumes 220.

Other modules may also communicate with each other through variousbuses. The replication module 230, the storage pool management module240, and the read module 270 communicate with the incremental log 250via the bus 104, and the incremental log 250 communicates with thestorage pool 260 via the bus 202.

While depicted as components connected by buses to various othercomponents of the virtual incremental storage system 200, thereplication module 230, the storage pool management module 240, the readmodule 270, and the incremental log 250 each may reside on one or morestorage controllers (not shown). Alternatively, the aforementionedmodules and the incremental log 250 may reside in the storage pool 260or on the host 210.

During one manner of backup operations of the present invention, thehost 210 sends data to the baseline volume 170. The data transmitted tothe baseline volume 170 may contain a baseline image of the primaryvolumes 220, corresponding to a replicated image of the primary volumes220 at a particular point in time. In one application, the baselineimage stored on the baseline volume 170 is used during recoveryoperations to restore the primary volumes 220 to a previous state.

In certain embodiments, the host 210 may instruct the replication module230 to conduct replication operations. The replication module 230responds by replicating incremental changes made to the primary volumes220 and storing the replicated changes on the storage pool 260.Incremental changes replicated by the replication module 130 may includeincremental snapshot data of the primary volumes 220.

In one embodiment, the incremental changes stored in the storage pool260 may be accessed at any time through the read module 270. The readmodule 270 provides an alternate access path to data stored on thestorage pool 260, for general use purposes or in case of failure of theprimary volumes 220.

In response to failure of the primary volumes 220, the host 210 mayinitiate recovery operations. During one embodiment of recoveryoperations, the incremental changes stored in the storage pool 260 arecombined with the baseline image stored on the baseline volume 170 torestore the primary volumes 220 to a desired state. In one embodiment,the incremental changes are incremental snapshot data that are combinedwith the baseline image data to reconstruct the primary volumes 220.

Replication operations conducted by the replication module 230 arefacilitated by the incremental log 250, which translates read and writerequests to physical addresses in the storage pool 260. In oneembodiment, the incremental log 250 includes a lookup table or othersimilar data structure that facilitates translating read and writerequests to physical addresses on the storage volumes 262 within thestorage pool 260.

The storage pool 260 includes a set of storage volumes 262. In oneconfiguration, the storage pool management module 240 adds one or moreunassigned storage volumes 262 b to a virtual volume (not shown) inresponse to observing reduced storage space on the virtual volume.Storage volumes 262 that have been assigned to the virtual volume areillustrated with a shaded surface as allocated storage volumes 262 a inFIG. 2.

The storage pool management module 240 may remove one or more allocatedstorage volumes 262 a from the virtual volume in response to observingsurplus storage space on the virtual volume. The removed storage volumes262 become unassigned storage volumes 262 b, which may be allocated to asecond virtual volume. The replication module 230 may replicate datastored on a multiple primary volumes 220 and store the replicated dataon corresponding virtual volumes. Thus, data may be replicated frommultiple primary volumes 220 to multiple virtual volumes, where eachvirtual volume includes one or more allocated storage volume 262 a.

In certain embodiments, the storage pool management module 240 allocatesstorage volumes 262 to the virtual volume according to allocationcriteria determined by policies such as those set by the policymanagement module 212. The policies may include allocation criteria thatdefine appropriate conditions for allocating storage volumes 262 to thevirtual volume. In one embodiment, appropriate conditions include apercentage of storage space utilization of the allocated storage volumes262 a, such as 95% storage space utilization, or the like.

As an example of using allocation criteria, the storage pool managementmodule 240 may detect high storage space utilization on initiallyallocated storage volumes 262 a, and in response allocate a new storagevolume 262 to the virtual volume. Although the new storage volume 262 isallocated to the virtual volume, data may continue to be written to theinitially allocated storage volumes 262 a until storage space on theinitially allocated storage volumes 262 a is completely utilized. In theaforementioned scenario, in response to the initially allocated storagevolumes 262 a reaching 100% storage space utilization, data writingcommences on the new storage volume 262.

Through policies such as those supported by the policy management module212, a user may determine allocation criteria and instruct the storagepool management module 240 to automatically allocate storage volumes 262to the virtual volume in response to allocation criteria fulfillment.Alternatively, the user may use policies to instruct the storage poolmanagement module 240 to prompt the user in response to allocationcriteria fulfillment.

One example of prompting a user includes informing the user thatallocation criteria have been fulfilled and requesting the user to takeaction. The user may take action by instructing the storage poolmanagement module 240 to allocate an unassigned storage volume 262 b tothe virtual volume. Alternatively, the user may take action byterminating replication operations. In addition, the user may use thestorage pool management module 240 to create additional virtual volumesand manage the virtual volumes.

One configuration of the present invention allows data from multipleprimary volumes 220 to be replicated onto one virtual volume. Forinstance, the replication module 230 may replicate data from a firstprimary volume 220, utilizing up to 40% of the virtual volume. Thereplication module 230 may also replicate data from a second primaryvolume 220, utilizing up to 60% of the virtual volume. As the allocatedstorage volumes 262 a in the virtual volume reach a storage spaceutilization determined by the allocation criteria, the storage poolmanagement module 240 allocates additional storage volumes 262 to thevirtual volume.

The depicted virtual incremental storage system 200 provides greaterstorage flexibility than currently available storage systems. Theincreased flexibility is due in part to the unique nature of the storagepool management module 240, which allocates storage volumes 262 to thevirtual volume as required. As a result of this flexibility, a highlyefficient use of storage volumes 262 conserves storage space andprevents potential replication failures.

FIG. 3 is a schematic diagram illustrating one embodiment of a virtualincremental storage apparatus 300 of the present invention. The depictedembodiment of the virtual incremental storage apparatus 300 includes abus 104, a bus 202, a write module 330, a storage pool management module240, an incremental log 250, a storage pool 260, allocated storagevolumes 262 a, unassigned storage volumes 262 b, and a read module 270.

The write module 330, the storage pool management module 240, and theread module 270 communicate with the incremental log 250 via the bus104, and the incremental log 250 communicates with the storage pool 260via the bus 202. The incremental log 250 translates read and writerequests from the write module 330 and the read module 240 to physicaladdresses used to access the storage pool 260. In one embodiment, thewrite module 330 is the replication module 230 and is configured tosupport replication operations in addition to standard write operations.

In one embodiment, the incremental log 250 resides on the storagevolumes 262 stored within the storage pool 260. Changes made to a sourcevolume may be generated and appended to the incremental log 250 by thewrite module 330. In another embodiment, the incremental log 250 is alookup table that maps the changes replicated from the source volume tophysical blocks on the storage volumes 262.

In certain embodiments, the lookup table or similar data structure mapsa virtual address to a block address in the storage pool 260. In oneembodiment, the storage address includes a volume identifier and also ablock identifier, which may be a logical block identifier. The volumeidentifier indicates which storage volume 262 contains the data mappedto by the virtual address, and the block identifier indicates where thedata resides on the storage volume 262.

The read module 270 retrieves data stored within the storage pool 260.The read module 270 may retrieve data in response to failure of a sourcestorage volume, or the read module 270 may retrieve data for general useby an application. In addition, the read module 270 may retrieve datafrom the storage volumes 262 in order to send the data to a backupdevice, such as a tape system or the like.

The storage pool management module 240 may selectively allocate storagevolumes 262 to the virtual volume. In one embodiment, the storage poolmanagement module 240 allocates storage volumes 262 to the virtualvolume in response to observing reduced space on the virtual volume.Additionally, the storage pool management module 240 may also removestorage volumes 262 from the virtual volume in response to observing asurplus of storage space on the virtual volume.

In one embodiment, the removed storage volumes 262 become unassignedstorage volumes 262 b. The storage pool management module 240 mayallocate one or more of the unassigned storage volumes 262 b to othervirtual volumes. Consequently, different storage volumes 262 from thestorage pool 260 may be allocated to separate virtual volumes.

The storage pool 260 of the present invention may be a RAID array of astorage device corresponding to the storage volumes 262. In oneembodiment utilizing a RAID array, the incremental log 250 maintains astripe map that facilitates translating read and write requests tophysical data blocks on the storage volumes 262. Using a RAID array forthe storage pool 260 allows for high performance reading and writing,large storage capacities, data redundancy, error correcting capability,and ease of implementation.

The virtual incremental storage apparatus 300 of the present inventionminimizes waste of storage space. In addition, the virtual incrementalstorage apparatus 300 provides a dynamically sizable storage pool 260for receiving replicated data, such as incremental snapshot data of theprimary volumes 220 and the like.

FIG. 4 is a block diagram illustrating one embodiment of an incrementallog and associated storage volumes 400 of the present invention. Thedepicted embodiment of the incremental log and associated storagevolumes 400 includes a set of allocated storage volumes 262 a, a lookuptable 410, a virtual address 412, a storage address 414, a blockidentifier 416, a volume identifier 418, and one or more storage devices426.

In the depicted embodiment, the incremental log 250 includes a lookuptable 410. The lookup table 410 may be stored in RAM (random accessmemory), or alternatively, the lookup table may be stored in the storagepool 260. In one embodiment, the lookup table 410 is part of a stripemap, and the storage pool 260 constitutes a RAID array of storagedevices.

One form of the lookup table 410 contains entries corresponding tovirtual addresses 412 and storage addresses 414. Individual storageaddresses 414 may correspond to individual virtual addresses 412. Thecorrespondence between storage addresses 414 and virtual addresses 412constitutes a mapping of virtual addresses 412 to physical data blocksstored at the storage addresses 414.

In the depicted embodiment, each storage address 414 includes a blockidentifier 416 and a volume identifier 418. The volume identifier 418indicates which allocated storage volume 262 a contains data pointed toby the virtual address 412, and the block identifier 416 indicates whichlogical block on a storage device 426 within the allocated storagevolume 262 a contains the data. In the RAID array embodiment, where thestorage pool 260 constitutes a RAID array, the volume identifier 418 maybe a logical unit number (LUN) or virtual logical unit number (VLUN).

In the depicted Figure, a few examples of mappings are illustrated. In afirst example, a virtual address 412 with value 0x500 maps to a volumeidentifier 418 with value 0 and block identifier 416 with value 0x100.The volume identifier 418 with value 0 points to an allocated storagevolume 262 a also having value 0. In addition, the block identifier 416with value 0x100 points to a physical address of 0x100 on a particularstorage device 426 within the storage volume 262 a that has a value of0.

In another example of a mapping, a virtual address with value 0x510 mapsto a volume identifier 418 with value 1 and block identifier with value0x100. The volume identifier 418 with value 1 points to an allocatedstorage volume 262 a also having value 1, and the block identifier 416with value 0x100 points to a physical address of 0x100 on a particularstorage device 426 within the storage volume 262 a that has a value of1.

In each aforementioned example, the block identifier 416 had a value of0x100; however, each block identifier 416 pointed to different data ondifferent allocated storage volumes 262 a due to the to differentstorage volumes 262 a referenced by the volume identifiers 418. Theseexamples illustrate the low complexity of the incremental log andassociate storage volumes 400, which enables low-complexity read andwrite operations to different storage volumes 262 within the storagepool 260.

FIG. 5 is a flow chart diagram illustrating one embodiment of a virtualincremental storage method 500 of the present invention. The depictedembodiment of the virtual incremental storage method 500 includes areplication operation test 510, a replicate changes step 515, a reducedspace test 520, an allocate storage volume step 525, a write operationtest 530, an append storage mapping step 535, a read operation test 540,a map virtual address step 545, and an exit requested test 550. Thevirtual incremental storage method 500 may be used to reduce storagewaste in an incremental storage subsystem such as the storage pool 260depicted in FIGS. 2 and 3.

The depicted embodiment of the virtual incremental storage method 500begins with the replication operation test 510. The replicationoperation test 510 determines whether replication operations on a sourcevolume such as a primary volume 220 have been initiated. In response todetermining that replication operations have been initiated, the methodproceeds to the replicate changes step 515.

In one embodiment of the replicate changes step 515, a module such asthe replication module 230 replicates data from a source volume such asthe primary volume 220 and sends the replicated data to the storage pool260. Replication operations may include snapshot operations, or thelike. After providing replicated changes of the source volume 515, themethod loops back to the beginning of the virtual incremental storagemethod 500.

The reduced space test 520 determines whether additional storage volumes260 need to be allocated to the virtual volume. In one embodiment, amodule such as the storage pool management module 240 determines whetheradditional storage volumes 260 need to be allocated to the virtualvolume. In response to the reduced space test 520 determining thatadditional storage devices are required, the method proceeds to theallocate storage volume step 525.

The allocate storage volume step 525 provides additional storage spaceby adding a storage volume to the virtual volume. In one embodiment, thestorage volumes 262 are allocated to the virtual volume by the storagepool management module 240. The method continues by looping back to thebeginning of the virtual incremental storage method 500.

The write operation test 530 determines whether a write operation hasbeen initiated. In response to the write operation test 530 determiningthat a write operation has been initiated, the method proceeds to theappend storage mapping step 535. In one embodiment, the append storagemapping step 535 includes appending a storage address 414 comprising avolume identifier 418 and block identifier 416 to a lookup table 410, astripe map, or a similar data structure. The storage address 414 pointsto physical data stored on the storage volumes 262. In response tocompletion of the append storage mapping step 535, the method continuesby looping back to the beginning of the virtual incremental storagemethod 500.

The read operation test 540 determines whether a read operation has beeninitiated. In one embodiment, the read module 270 performs readoperations on the storage volumes 262. In another embodiment, thestorage pool management module 240 performs read operations on thestorage volumes 262 to determine storage capacity utilization on thestorage volumes 262.

In response to determining that a read operation has been initiated themethod proceeds to the map virtual address step 545. As one example ofthe map virtual address step 545, the read module 270 sends requests tothe incremental log 250, which translates the requests into physicaladdresses on the storage volume. By translating the requests intophysical addresses, the incremental log 250 enables the read module 270to retrieve data stored at the physical addresses.

The exit requested test 550 determines whether a request to exit themethod 500 has been issued for example by control process associatedwith a storage subsystem. If a request has been issued the method 500terminates 560. If no request has been issued the method continues bylooping back to the beginning of the virtual incremental storage method500.

The virtual incremental storage method 500 of the present invention maybe used to continuously monitor storage volumes such as the storagevolumes 262 to determine whether additional space is required on avirtual volume. By continuously monitoring the storage volumes anddynamically allocating unassigned storage volumes to the virtual volumeas needed, the virtual incremental storage method 500 of the presentinvention improves storage capacity utilization over currently availablemethods.

In certain embodiments of the present invention the storage poolmanagement module 240 may provide one or more programming interfaces tofacilitate management of the storage pool 260. For example, in oneembodiment, a user creates system volumes (logical volumes) of any sizeand RAID level and adds them to the storage pool 260. Additional stateinformation may be added to standard state information associated withlogical drives to facilitate managing the volumes within the storagepool 260.

Referring to FIG. 6, a snapshot pool programming interface 600 includesan addLogicalDrive function 610, a removeTargetVolume function 620, areserveTargetVolume function 630, a releaseTargetVolume function 640,and a selectTargetVolume function 650. The snapshot pool programminginterface 600 facilitates managing snapshot resources and simplifiesprogramming complexity.

The addLogicalDrive function 610 adds a logical drive to the storagepool 260. In certain embodiments, adding the logical drive to thestorage pool prevents access to the drive by a host unless the drive hasbeen reserved and used in a snapshot relationship. In one embodiment,any storage subsystem state information associated with the logicaldrive must have been previously initialized in the storage subsystem forsnapshot target pool use.

Referring to FIG. 7, a snapshot criteria data structure 700 includes acriteria name 710, a source drive 720, a target drive 730, a start block740, an end block 750, a raid level 760, and a set of snapshot flags770. The snapshot criteria data structure 700 may be used to simplifyselection of target volumes from a storage pool.

The criteria name 710 specifies a name for the criteria defined in thesnapshot criteria data structure 700. In one embodiment, a sourcelogical drive number and name are concatenated in order to uniquelyidentify snapshot criteria. The source drive 720 identifies the logicaldrive to be used as a source for snapshot operations. In one embodiment,the source drive is identified using a logical drive number. Similarly,the target drive 730 identifies the logical drive to be used as a targetfor snapshot operations. In one embodiment, the target drive 730 may bespecified as a null identifier indicating that a particular is notrequired. A null identifier may be overwritten upon selection of anactual target volume.

The start block 740 and the end block 750 facilitate optionalspecification of a starting and ending logical block address (LBA) onthe source volume for a snapshot operation. The raid level 760 indicatesacceptable RAID levels for the snapshot target. In one embodiment, theraid level 760 is a bit field with each bit representing a distinct RAIDlevel.

The snapshot flags 770 specify options for the snapshot operation. Forexample, the Full/Partial! flag 770 a indicates whether a full snapshotor partial (i.e. incremental) is to be conducted. The background flag770 b indicates whether the snapshot is to be conducted in thebackground. The block address flag 770 c indicates whether the snapshotis to be limited to the blocks specified by the start block 740 and theend block 750.

The snapshot criteria data structure 700 may be used to specify theinformation necessary to select a snapshot target drive from thesnapshot pool when necessary. In certain embodiments, a single systemdrive may have more than one snapshot criteria defined. In this manner,different criteria can be used for a backup volume than for a testvolume. In one embodiment, default criteria may be assigned to a systemdrive. The use of default criteria facilitates using a particular sourcedrive without specific knowledge of the drive attributes.

By using the snapshot criteria concept, the criteria used for a snapshotoperation may be specified prior to establishing a snapshot pool. Also,the snapshot targets need not be explicitly defined by an applicationrequesting snapshot services. In one embodiment, the snapshot criteriafor a given operation are created and stored on a RAID controller alongwith other configuration data associated with storage devices. Storingthe snapshot criteria on a RAID controller facilitates accessingsnapshot criteria multiple hosts.

Referring to FIG. 8, a snapshot criteria programming interface 800includes a createCriteria function 810, a deleteCriteria function 820, agetCriteria function 830, a getNextCriteria function 840, and anexecuteSnapshot function 850. The snapshot criteria programminginterface 800 simplifies specifying and executing snapshot operations.

The createCriteria function 810 creates new snapshot criteria for a(source) system drive or volume. In one embodiment, a unique name isspecified for the criteria and the criteria conforms to the criteriadata structure shown in FIG. 7. The deleteCriteria function 820 deletescriteria associated with a system drive or volume. The getCriteriafunction 830 returns the snapshot criteria associated with a specifiedcriteria name for a specified system drive. The getNextCriteria function840 may be used to sequence through all criteria defined for a specifiedsystem drive or volume.

The executeSnapshot function 850 executes a snapshot operationconforming to the criteria associated with a specified criteria nameassociated with a system drive. In one embodiment, the executeSnapshotfunction 850 allocates a snapshot target from the snapshot pool 260,establishes the appropriate snapshot relationship between source andtarget devices, flushes the operating system files associated withsystem drive, initiates the snapshot operation, and reattaches thesource device to the host system.

The present invention greatly simplifies data backup and other datareplication operations.

The present invention may be embodied in other specific forms withoutdeparting from its spirit or essential characteristics. The describedembodiments are to be considered in all respects only as illustrativeand not restrictive. The scope of the invention is, therefore, indicatedby the appended claims rather than by the foregoing description. Allchanges which come within the meaning and range of equivalency of theclaims are to be embraced within their scope.

1. An apparatus for managing incremental storage, the apparatuscomprising: a policy management module that sets a storage managementpolicy for storage capacity of an existing incremental backup virtualvolume, wherein the existing incremental backup virtual volume storesincremental storage data from an incremental storage operation of aprimary volume and the existing incremental backup virtual volumecomprises at least one storage volume of a storage pool; a storage poolmanagement module that monitors available storage capacity of theexisting incremental backup virtual volume and to change the storagecapacity of the existing incremental backup virtual volume in responseto the storage management policy and the available storage capacity,wherein changing the storage capacity comprises dynamically allocatingand de-allocating a storage volume of the storage pool to the existingincremental backup virtual volume in response to the change to thestorage capacity; and an incremental log corresponding to the existingincremental backup virtual volume, wherein the incremental log maps avirtual address of the existing incremental backup virtual volumeassigned to the incremental storage data to a physical storage addressof the at least one storage volume of the storage pool wherein thepolicy management module and the storage module comprise at least one oflogic hardware elements and executable code, the executable code storedon one or more computer readable media.
 2. The apparatus of claim 1,wherein the physical storage address comprises a volume identifier. 3.The apparatus of claim 1, wherein the storage pool management modulefurther allocates and de-allocates a portion of a storage volume to theexisting incremental backup virtual volume.
 4. The apparatus of claim 1,further comprising a read module that reads data stored in the existingincremental backup virtual volume by way of a data path independent froma data path used to store the incremental storage data.
 5. The apparatusof claim 1, wherein the storage pool management module allocates andde-allocates storage volumes without user input.
 6. The apparatus ofclaim 1, wherein the storage pool management module further allocates asecond storage volume to the existing incremental backup virtual volumein response to a reduction in available space on a first storage volume.7. The apparatus of claim 1, wherein the storage pool comprises at leastone storage volume corresponding to a redundant set of storage devices.8. The apparatus of claim 7, wherein the redundant set of storagedevices comprises a RAID (Redundant Array of Independent Disks) storagearray.
 9. The apparatus of claim 1, wherein the incremental logcomprises a lookup table.
 10. The apparatus of claim 1, wherein thestorage pool management module further increases the storage capacity inresponse to the available storage capacity increasing above a firststorage capacity threshold and to decrease the storage capacity inresponse to the available storage capacity decreasing below a secondstorage capacity threshold.
 11. The apparatus of claim 10, wherein thestorage pool management module further de-allocates storage volumeswherein the de-allocated storage volumes are available for allocation toa virtual volume unrelated to the existing incremental backup virtualvolume.
 12. A system for managing incremental storage, the systemcomprising: a primary volume; a baseline volume that stores a baselinebackup copy of data on the primary volume; a storage pool with at leastone storage volume that stores incremental storage data from anincremental storage operation of the primary volume in response tochanges in data stored on the primary volume after storing the baselinebackup copy on the baseline volume, wherein the storage volume isallocated as an existing incremental backup virtual volume; a policymanagement module that sets a storage management policy for availablestorage capacity of the existing incremental backup virtual volume; astorage pool management module that monitors available storage capacityof the existing incremental backup virtual volume and changes thestorage capacity of the existing incremental backup virtual volume inresponse to the storage management policy and the available storagecapacity, wherein changing the storage capacity comprises dynamicallyallocating and de-allocating a storage volume of the storage pool to theexisting incremental backup virtual volume in response to the change tothe storage capacity; and an incremental log corresponding to theexisting incremental backup virtual volume, wherein the incremental logmaps a virtual address of the existing incremental backup virtual volumeassigned to the incremental storage data to a physical storage addressof the at least one storage volume of the storage pool.
 13. The systemof claim 12, further comprising a replication module that transmits theincremental data from the primary volume to the existing incrementalbackup virtual volume.
 14. The system of claim 13, wherein theincremental data comprises incremental snapshot data of the primaryvolume.
 15. The system of claim 12, wherein the primary volume comprisesa plurality of primary volumes; the existing incremental backup virtualvolume comprises an existing incremental backup virtual volumecorresponding to each primary volume, each existing incremental backupvirtual volume comprising at least one storage volume of the storagepool; the incremental log comprises an incremental log corresponding toeach existing incremental backup virtual volume; and the storage poolmanagement module monitors available capacity of each existingincremental backup virtual volume and allocates and de-allocates storagevolumes for each existing incremental backup virtual volume from thestorage pool.
 16. The system of claim 12, wherein the storage modulefurther allocates and de-allocates a portion of a storage volume to theexisting incremental backup virtual volume.
 17. The system of claim 12,wherein the baseline volume is part of the storage pool.
 18. The systemof claim 12, wherein the policy management module resides in a host. 19.A computer readable storage medium comprising computer readable programcode for managing incremental storage, wherein the program code:monitors available storage capacity of an existing incremental backupvirtual volume; changes a storage capacity of the existing incrementalbackup virtual volume in response to a storage management policy and theavailable storage capacity, wherein the existing incremental backupvirtual volume stores incremental storage data from an incrementalstorage operation of a primary volume and the existing incrementalbackup virtual volume comprises at least one storage volume of a storagepool, the at least one storage volume allocated to the existingincremental backup virtual volume; dynamically allocates andde-allocates a storage volume of the storage pool to the existingincremental backup virtual volume in response to a change to the storagecapacity of the existing incremental backup virtual volume; and maps inan incremental log corresponding to the existing incremental backupvirtual volume a virtual address of the existing incremental backupvirtual volume assigned to the incremental storage data to a physicalstorage address of the at least one storage volume of the storage pool.20. The computer readable storage medium of claim 19, wherein theprogram code further provides incremental snapshot data of the primaryvolume in response to a replication operation.
 21. The computer readablestorage medium of claim 19, further comprising reading data stored inthe storage pool by way of a data path independent from a data path usedto store incremental data.